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Abstract 

Consider a sphere of radius y/n in n dimensions, and consider X, a ran- 
dom variable uniformly distributed on its surface. Poincare's Observation 
states that for large n, the distribution of the first k coordinates of X is 
close in total variation distance to the standard normal iV(0, In this 
paper, we consider a larger family of manifolds, and X taking a more gen- 
eral distribution on the surfaces. We establish a bound in the stronger 
Kullback-Leibler sense of relative entropy, and discuss its sharpness, pro- 
viding a necessary condition for convergence in this sense. We show how 
our results imply the equivalence of ensembles for a wider class of test 
functions than is standard. We also deduce results of de Finetti type, 
concerning a generalisation of the idea of orthogonal invariance. 

1 Notation and Definitions 

Diaconis and Freedman |6| consider X, a random variable uniformly distributed 
on the surface of a sphere of radius \fn in n dimensions. They show that the 
distribution of 7r n &X, the first k coordinates of X, is close in total variation 
distance to a Gaussian for large n. They indicate a natural connection between 
this problem and the equivalence of ensembles for the free Hamiltonian, where 
a sphere corresponds to a surface of constant kinetic energy. In this paper, 
we consider a more general family of manifolds defined by symmetric, additive 
Hamiltonians. Unlike Diaconis and Freedman we shall not assume a uniform 
distribution on these surfaces. We prove convergence in the stronger Kullback- 
Leibler sense of relative entropy distance. 
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First, we introduce notation. Given measurable / : R — > 1 U {00} we de- 
fine F to be the set on which / is finite, and let R n = R n (f) be given by 
.Rn(x) = YH=i f( x i)i We define the surface S n (t) = {x : i? n (x) = nt} C 
F n . Given / and c > 0, define g nfi for the density of a Gibbs distribution: 
<? n ,c(x) = exp(— cRn(x))/Z™. We assume that / has the property that Z c = 
J exp(—cf(x))dx is finite for all c > 0. Define the projection ir n k '■ K™ — > K fc 
restricting to the first coordinates: vr n fc (a;i, x 2 , . . . , i„) = (xi, x 2 , . . . , For 
any probability density q on IR n write the energy e(q) = J q(x)R n (x)dx and the 
entropy h(q) = f — g(x) logg(x)dx. 

For a given /, we know that Gibbs densities maximise the entropy for a given 
energy. (This characterisation of Gibbs measures via a variational principle is 
discussed in Chapter 15 of Georgii [0]). This is done by considering the Kullback- 
Leibler distance D(p\\g niC ), where D(f\\g) = f /(x) log(/(x)/#(x))dx, for / and 
g probability densities on IR fc . 

Example 1.1 Two cases in particular are significant here: 



1. If f(x) = x 2 , then Z c = sf^Jc, and # n , c (x) = \Xa=x 9i,c( x i)> wh ere g 1>c is 
a N(0, (2c) _1 ) density, the density which maximises entropy subject to a 
variance constraint. 

2. If f{x) = x for x > and f(x) = 00 for x < 0, then Z c = 1/c, and 
<?n,c( x ) = nT=i 9i,c( x i)> where gx jC is an Exp(c) density, the density which 
maximises entropy on the positive half-line subject to a mean constraint. 

Now we can state Diaconis and Freedman's results on the normal and exponential 
case in the form: 

Theorem 1.2 For f as in Example |i.4 ^| or and for a given t G (0, 00), 

take c such that e(<7i,c) — t and let X be distributed uniformly on the surface 
<S n (t) Q F n . Writing p n ,k,t{y) f or the density of 7r n> kX.: 
For f(x) = x 2 , as in Example [TH-IH, d>Tv(Pn,k,t, 9k,c) < ^{k + 3)/(n — k — 3). 
For fix) = x, as in Example [0].[§, c? T y(p n)fcii X, g k>c ) < 2(k + l)/(n — k — 1). 

The following theorem is the main result of this paper. It involves a class T 
of functions (we postpone the precise definition to Definition |2.2|, but roughly 



speaking we want / to be strictly increasing and well-behaved at zero). Let h n j 
be the density corresponding to uniform distribution on S n (t). 

Theorem 1.3 Assuming f G T ' , for any t G (0, 00) take c such that £(<7i, c ) — t 
and consider X, a random variable with density p on S n (i). Writing p n ,k,t{y) f or 
the density o/7r n> feX, for some constant C = C(f): 

( n \ 9 
D(p nAt \\g k ,c) < D(p\\h n ,t) + log + 



n — k J sfnjC — 1 
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A major motivation for this work, beyond the intrinsic interest of generalising 
Diaconis and Freedman's work |J, comes from the question of so-called 'equiv- 
alence of ensembles'. Given a Hamiltonian H(x), for x 6 IR n we can consider 
the microcanonical ensemble (uniform measure with density h n t on the surface 
{x : H(x) = nt}) and grand canonical ensemble (Gibbs measure with density 
g n>c proportional to exp(— cH(x)) for x e R n ). The principle of equivalence of 
ensembles suggests that in some sense h n> t and g nfi are close together, that is 



for some class of test functions p. Convergence in total variation distance (as 
established by Diaconis and Freedman) implies that Equation (U) holds for p 
bounded and continuous and depending only on k = o(n) coordinates. Theorem 
|1.3| implies this for a wider class of test functions, including the Hamiltonian 



for p depending only on k coordinates, if p(x\, . . . ,Xk) is bounded above by a 
multiple of 1 + Yli=i f( x i)- Here, k need not be fixed, and can grow as o(n). 

Proof This is a consequence of Theorem |1.3| in conjunction with Lemma 3.1 
of Csiszar 0. The latter states that D(u n \\v) — > implies J p(x)u n (x)dx — > 
j p(x)v(x)dx, for any p such that j exp(tp(x))v(x)dx < oo for small \t\. This 
integral is seen to be finite by definition of the partition function. □ 

A second application is to results of de Finetti type, as described in |J. For 
example, define an infinite sequence Xi,X2,... of random variables to be or- 
thogonally invariant if for any n, the law of (X\, . . . ,X n ) is invariant under 
orthogonal transformations of R™. Schoenberg ]I2] states that all orthogonally 
invariant distributions are mixtures of normals. Diaconis and Freedman || prove 
this by showing that the first k of n orthogonally invariant variables are within 
2k /n of a mixture of normals, so a passage to the limit provides the infinite result. 
In a similar way we can consider /-invariant measures; that is, sequences of 
random variables such that for any n, the law of {X\, . . . , X n ) is invariant under 
continuous transformations of W 1 which preserve R n (x\, . . . ,x n ). We show: 

Corollary 1.5 For f 6 T , the only f -invariant measures P are mixtures of 
Gibbs measures. That is, if we write Goo iC for the distribution of Yi,Y 2 , . . . , 
where Yi are independent with density gi jC , there exists a measure X, valued on 
c > 0, such that 




(1) 



itself. 



Corollary 1.4 Assuming f G T , for any t e (0, oo) 








P 
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Proof First we consider finite subsequences of X±, .... If measure P n is invariant 
under i? n -preserving transformations then (as in ||), P n is constant on each 
manifold S n {t), so: 

P n = J h nt tfJ, n {t)dt, 

for some measure probability measure Now projecting down to the first k 
coordinates: 



Writing c(t) for the unique c such that £(gi, c ) = t, and defining Rk = J gk^^nitydt, 
then: 



\Rk ~ Qk\\rv 



< J Vn(t)\\(ir n ,khn,t) - 9k,c(t)\\dt < \J 



2k 

n — k' 



by Theorem |1.3| . 

Duplicating Diaconis and Freedman's tightness argument we can show that as 
n — > oo, fi n must have a convergent subsequence, with limit fi. Mapping from t 
to c(t), we find a A with the required properties. □ 

The fact that projecting a uniform distribution on a sphere approximately gives 
a Gaussian is often referred to as Poincare's Observation (see for example ||10|| ). 
However, in Section 6 of their paper, Diaconis and Freedman suggest that this 
attribution is wrong, and that the earliest reference to it in the probability liter- 
ature comes in the work of Borel. Nonetheless, it appears that the observation is 



even older than this, and can be traced back to Mehler [|TT |. 
Csiszar || uses entropy-theoretic methods to consider the distribution of a ran- 
dom variable Xi, conditional on the vector X = (X 1; . . . ,X n ) G A for some 
set A. However, these results rely on A being a 'thick set' of positive measure, 
whereas in the present paper we consider the so-called 'thin shell' case. Dembo 
and Zeitouni || extended Csiszar's results to the distribution of the first k co- 
ordinates, where k = k(n) can vary with n, and (as in this paper) discovered 



that a sufficient condition for convergence is that k{n) = o{n). Schroeder |L3 
generalised this result to Markov processes, and Comets and Zeitouni even to 
mean-field perturbations of Markov processes. However, all these papers use the 
assumption that A has non-empty interior. This corresponds to a weak limit the- 
orem, bounding the probability of a particular set, whereas our methods require 
a local limit theorem, bounding densities. 



2 Class of Functions Considered 

It is natural to ask for the widest possible class of functions / such that results 
such as Theorem |1.2| hold. For technical reasons, we will need to control w n , the 
density of _R n (X), when X has Gibbs density g n>c . Specifically we need an upper 
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bound on \og(w n -k(s) / w n (t)) for certain s,t. Theorem p7T2] holds because w n is 



known exactly (in Example [TTT|.[T] i? n (X) is the sum of squares of normals, with 
the Xn distribution, and in Example |1.1| .|2| i? n (X) is the sum of exponentials, with 
the r(n, c) distribution). A later paper by Borovkov [|TJ extends Diaconis and 
Freedman's work to the case f(x) = \x\ p , again using exact calculations of the 
density. Whilst his bounds are tighter, the method will not extend to the general 
case - it gives no information about Hamiltonians of the form f(x) = x 2 + ex A . 
However, we can make progress in other cases too. The key observation is that 
Xi are IID (with marginal density <7i, c ), so -R n (X) = Y^7=i /(-^») ^ s a sum °f n 
IID random variables. Using a local version of the Central Limit Theorem, we 
will be able to show that the densities are sufficiently close in supremum norm 
to a Gaussian density for our result to go through. We obtain a proposition 
reminiscent of Equation (2.9) of 0. 

Proposition 2.1 Consider X with density g n c and let Yj = f(Xj), with KYj = 
fi, Var(Yj) = a 2 , and characteristic function (f>(u) = Eexp^wY}). Assume that 
I = J \<f)(u)\ r du is finite for some r > 1 and m = E|Y — /i| 3 is finite. If Wk is the 
density of Rk(X.) = Yli=i ^ then there exists a constant C = C{Y) such that for 
n — k > r: 

w n - k (z) \ . , f n \ 2 



log in^ - log (w=k r /n/c-i for allzER - 

Proof The local limit theorem tells us that there exists a constant C(Y) such 
that if /„ is the density of Yl(Xi ~~ A 4 )/ Vna 2 then for n > r: 



sup 



f n (x) -I=exp(-x 2 /2) 

v 2tt 



< C(Y) 



V2 



A careful reading of (for example) Section 46 of Gnedenko and Kolmogorov 
[Q shows that the dependence of C(Y) on Y comes through m,a 2 ,I and v = 
sup t><T 2 /m \<p{t) \ < 1. 
Rescaling this, we know that for any z: 

Vn-k + C v^-C 
w n -k{z) < and w n {nn) > 



(n — k)\ph^o 2 ' nv2TTP 
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Taking the ratio of these terms we deduce the result, since: 



log| v^z±^<a 1+ 2c 



□ 



We can now describe the class of surfaces that our Theorem will cover. These 
conditions on function / are chosen so that they imply that the local limit theorem 
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will hold in Proposition EO. We confirm this in Lemmas [2.3| and [2.4| below. 



Roughly speaking, we require the function / to grow faster than a linear function 
on most of the domain, and to be well-behaved at 0. 

Definition 2.2 Define T to be the class of functions / :1->RU with: 

1. /(0) = ; f(x) right continuous at 0. 

2. f(x) is differentiable for x > with f'(x) > and: 

(a) There exist a\,a 2 such that f'(x) > a± > for x G (a 2 , 00). 

(b ) There exist 1 < q < 2 and a 3 > such that: 

hm mt — > a 3 . 

x\o /(rr) 

5. Either (a) f(x) = 00 /or x < or (b) f(x) = f(—x) for all x. As before, 
we write F for the interval on which f is finite. 

Notice that for any p > 1, picking f(x) = x p on x > and infinity elsewhere, 
or f{x) = \x\ p everywhere ensure that / G T. Thus the cases considered by 
Diaconis and Freedman and Borovkov [|IJ are included in our theorems. 
Conditions 1 and 2 ensure that Z c is finite and non-zero for c G (0, 00), since then 
/ exp(-cf(x))dx <a 2 + f(x) exp(-c/(x))/ai < a 2 + ex.p(-cf(a 2 ))/ca 1 , and 
continuity provides boundedness away from zero. 

Lemma 2.3 Assume f G JF. //X /ias density gx )C (x) = exp(—cf(x))/Z c and 
Y = f(X), then m = E\Y - EY\ 3 is finite. 

Proof In case ^(a) and |^(b) of Definition [2.2| f(X) will have the same density, 
hence we need only consider case |](a). Further, note that in this case, since Y > 0, 
m = E(y — /x) 3 , the 3rd cumulant. Since the moment generating function of Y is 
M(t) = Z c _ t /Z c , the cumulant generating function logM(t) = \ogZ c _ t — logZ c , 
and the result follows. □ 



Lemma 2.4 If f G T , then if X has density gi, c {x) = exp(—cf(x))/Z c and Y = 
f(X), with characteristic function <fi, then there exists r such that I — J \cf){u)\ T du 
is finite. 

Proof The density g(y) of Y is Z^ 1 exp(— cy)//'(/ _1 (y)). As described in for 
example Theorem 74 of [jH}| , if g G L p , for some 1 < p < 2 then I is finite for 
r > p/(p — 1). By Condition 2B of Definition 2.2, for y close to zero g(y) < 



const. exp(— cy)y 1 ^ q , so picking p > q ensures the finiteness of the integral at 0. 
Condition pa] gives us control as y — ► 00. □ 
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3 Nesting and Projection of Surfaces 



Recall that we want to project a density p from a manifold S n (t) onto its first 
k coordinates. Given p, we will first create (6p), a density on F n , by placing 
weighted copies of p on each manifold S n (u). One has a free choice of how to 
weight the individual manifolds. The picture is that of fitting together an infinite 
set of 'Russian dolls', each of different sizes, but each with the same pattern of 
densities on their surface. We then consider the projection of p by considering 
the projection of (9p) conditional on being on S n (t). 

By a suitable choice of weighting, we can make the projection well-behaved. For 
example, in the case of the uniform density on the sphere, we weight concentric 
spheres by the x 2 distribution to produce the normal density on W 1 . 
We need to develop a coordinate system that allows us to describe a point x in 
space by giving its distance from the origin and the point where the line from 
to x crosses S n (t). In the case of the sphere f(x) = x 2 , this corresponds to 
transforming between rectangular and polar coordinates. We shall require one 
further technical lemma, not proved here: 

Lemma 3.1 For any t e (0, oo), the equation s(gi tC ) = t has a unique solution 
as an equation in c, if f(0) = 0, f is right continuous at zero, increasing on 
x > 0, f(x) — > oo as x —> oc, and f(x) = oo for x < 0. 

More formally, we give a C 1 -foliation of _F n \{0} by compact (n — l)-dimensional 
manifolds S n (u), u > 0. Each manifold is endowed with a standard Riemannian 
metric and diffeomorphic, by a central projection, to either a unit sphere centered 
at the origin or to its intersection with a non- negative orthant. 

Proposition 3.2 For given t, under Conditions 1 and 2 of Definition \2.2\ there 
exists a bisection $> t between {x : x > 0} x S n (t) and F n \ {0}. The bisection $> t 
has a Jacobian A n j which is positive everywhere. 

Proof For given u and for any x G F n \ {0}, under Conditions 1 and 2 of 
Definition |2.2| , the equation R n (kx) = u has a unique solution in k > 0, by the 



Intermediate Value Theorem. Hence given (r, s), where r > and s e S n (t), we 
can find a unique x = ks, where k is chosen such that R n (ks) = nr. Conversely, 
given x, we can define the central projection S n} t(x) = /cx, where k is chosen such 
that R n (kx.) = nt, which is equivalent to saying that fcx 6 S n (t). We take the 
pair (r, s) = (_R n (x), SVm(x)). 

This foliation induces a coordinate system on F™\{0} via the bijection $ t . Let 
o n ^ t denote the induced Riemannian volume on S n (t). The Jacobian y4 ni (x) = 
A n ^(t, s) is determined since for any measurable function /: 



/ f(t,s)<S> t (dt,ds) = I f(R n (x),S n , t (x))dx 

f(t,s)A n>t (t,s)dta nit (ds). 



-xS n (t) 
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The positivity of /' ensures the positivity of A njt , since the local structure of 
the above foliation (and the induced map $ t ) may be also described as follows. 
Given a point x, such that 3>t(x) = (R n (x) , S n (x)) , for small 5x, we calculate 
$i(x + 5x). Now i? n (x + 5x) = i2n(x) + tfa^/'fo) + 0(5x 2 ), which is not 
identically equal to -R n ( x )- Similarly, if S n (x) = fcx e S n (t), then S^x + 5x) is 
— e)(x + #x), where e is chosen such that this lies on S n (t). The choice of 
e that achieves this is e = f'(xi)fixi)/(52 f'(%i) x i), which again ensures that 
S n (x + <5x) is not identically equal to 5„(x). □ 

Now having developed our coordinate system, we can describe the map 9 which 
takes a density on S n (t) and gives a density on F n . The motivation for this 
definition is that it gives an isometry between densities (see Lemma 



Definition 3.3 Given a probability density p on S n (t), we can define the product 
density {w n x p) on R + x S n (t), where w n is the density of R n (X) when X has 
density g nc . We can thus define the density {Op) induced by $ t on F n \{0}, since 
transforming to Cartesian coordinates, we know that: 

A nj f(i? n (x),6 nit (x)) 
which is by construction a probability density on F n \ {0}. 

The one significant difference, as mentioned by Borovkov, is that we will no longer 
consider the uniform distribution on the surface but rather consider /i ni t(v) = 
A n>t (nt,v)/ (J A nt t(nt,s)a n)t (ds)) for v e S n (t). 

Lemma 3.4 With the definitions above: D(6p\\g nc ) = D(p\\h n j). 

Proof Note that w n is characterized by: 



r 



w n (t)dt = J 5(„ >c (x)I(i? n (x) < r)dx 

= J exp(-ct)Z- n I(t <r)(J A n>t (nt, s)<7 n , t (ds)J dt. 

We deduce that for any u e F n , g n;C (u) = w n (R n (u))/ (J A n>t (nt, s)a n>t (ds)) . 
Hence by definition, (9h ntt )(u) = w n (R n (u))/ (f A ntt (nt,s)a n)t (ds)) = #„, c (u). 
This means that 

D(6p\\g n , c ) = D(6p\\6h n , t ) = J e P (x)\og ( ^g^fl) ) d * 
as required. □ 



S 



Definition 3.5 Given t > and a density p on S n (t), write p n ,k,t(y) f or the 
density of p projected by ir n ^ onto M fc . 

The key observation is that p n ,k,t(y) — Qn,k(y, nt)/w n (nt), which is the projection 
of {dp) on M fc conditioned on R n = nt. Here q n , k (y,nt) may be treated as the 
joint density of 7r nj fcX = y and i? n (X) = nt, where X is a random point of F n 
with density 9p. We thus establish the principal result of this paper: 



Proof of Theorem 1.3 We can write: 



D{Pn,k,t\ 9k,c) = / — nv~ g — r~\ r~ + — p ? \ \ d y 

J w n {nt) \gkAy> w n~k{nt-Rk{y)), 

f / \ i ( w n - k (nt - R k (y))\ , 

J V w n (nt) J 



Proposition |3.6| below deals with the first term. Lemmas [2.3| and |2.4| imply 
Proposition pT| , providing a bound uniformly in z on \og(w n ^k(z) / w n {nt)) , so we 
can deal with the second term. □ 

An analogue of the discrete log-sum inequality holds for integrals, and hence we 
deduce a projection inequality: 

Proposition 3.6 Under the previous definitions: 

qnAy ' nt) log ( , J^y^L_ ) dy < D{pl]hn y 



w n (nt) \g kjC (y) w n-k(nt - R k (y)) 

Proof Given integrable functions g(x), h(x), normalising to get probability den- 
sities p(x) = g(x)/ J g(x)dx, q(x) = h(x)/ J h(x)dx, the Gibbs inequality gives: 

f f f g( x )\ ( f \ ( \ g(x)dx 

< D(p\\q) I g(x)dx = / g(x) log I J dx — I / g(x)dx ) lor 1 



h(x) ) \J J \f h(x)dx 

Now, writing x = (y, u), where y G M fc , u G IR n_fc , notice that: 

q n ,k{y,nt) = J(9p)(y, u)I(R k (y) + R n „ k (u) =nt)du 

gk, c {y) w n-k{nt - R k {y)) = J {9h n>t ){y, u)I(R k (y) + R n - k {u) = nt)du 



Hence we deduce that for each y: 

q n ,k{y,nt) ( q n ,k(y^ nt ) 

log 



w n {nt) \gk,c(y) w n-k(nt - R k (y)), 

< / MCLSl , og ( ) m{y) + ^_ l( u) = nt)4u 

J w n {nt) \(9h n> t)(y,u)J 

= J p(S n (y, u)) log ( /^y^) ) KRk(y) + Rn-k(n) = nt)du 
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Integrating with respect to y, we obtain: 



w n (nt) \9k,c(y) w n-k(nt - Rk(y)). 
= J p(S n (y, u)) log ( ^s^u)) ) I(jRfc(y) + Rn ^ u) = nt)dudy 

= /^ (x))log (£^i)) I( ^ (xH ^ x 

as required. □ 



Corollary 3.7 Assume f & J 7 and taking t = e(g liC ), given the density p = h„ ;t 
on S n (t), then the projection p n> k,t satisfies: 

D( PnA t\\9k,c) < log (j^) + -^Jc^l' 

Observe that dTv(f,g) = / 1/0*0 — g(x)\dx < y/2D(f\\g) (see [[|), and hence in 
this case then the rate of convergence in total variation distance is 0(1/ y/n), as 
opposed to the 0(1 /n) which Diaconis and Freedman establish. This difference 
can be attributed to the fact that we approximate the densities w n , rather than 
being able to obtain exact bounds on them. 



4 A converse 



Given the density p = h n j on S n (t) we can see from Corollary [T7|that as n —>■ oo, 
and k/n — > then D(p n ^j\\9k,c) —> 0. This is also Diaconis and Freedman's |§ 
necessary and sufficient condition for convergence in total variation distance in 
the spherical case. We show that this condition holds for more surfaces than that: 



Proposition 4.1 Given f E T , consider the uniform probability density h n t on 
S n (t), and consider p n ^ }t , the distribution of its first k coordinates. If p n ,k,t — * 
gk, c in total variation distance then k/n — > 0. Hence the stronger result of 
D(Pn,k,t\\9k,c) implies that k/n — > 0. 

Proof Now considering Y with density g n ^ c , by conditioning and independence, 
the density rk(y) of (Rk(n ny kY)\R n (Y) = nt) satisfies: 

w k (y)w n -k(nt - y) 
w n (nt) 
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Hence since total variation distance is reduced by projection, we deduce that for 
any set L: 



d T v{Pn,k,t, 9k,c) > d T v{rk,w k ) > 2 f w k (y) [ n fc ^ ^ - 1 ) dy. 

Jl V w n {nt) J 

Now, using the estimates of the previous section, we know that choosing the 
interval L — (kt — e\/n — k, kt + ey/n — k) will ensure that the term in brackets 
is close to yjn/n — k — 1. Furthermore for n» k, Wk(L) will be close to 1, so we 
deduce the result. □ 
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